Multiple Protein Structure Alignment by Deterministic Annealing
نویسنده
چکیده
Although many algorithms have been developed so far for structure alignment problem based on either distancebased methods or vector-based methods, such as iterative dynamical programming, Monte Carlo simulation, graph theory, and mean field equations, only a few methods are published for multiple structure alignment, such as progressive structure alignment and Monte Carlo optimization. In this paper, we propose a novel method for solving multiple structure alignment problem, based on mean field annealing technique. We define the structure alignment as a mixed integer-programming (MIP) problem with the inter-atomic distances between two or more structures as an objective function[1]. The integer variables represent the marchings among structures whereas the continuous variables are translation vectors and rotation matrices with each protein structure as a rigid body. By exploiting the special structure of continuous partial problem, we transform the MIP into a nonlinear optimization problem (NOP) with a nonlinear objective function and linear constraints, based on mean field equations. To optimize the NOP, a mean field annealing procedure is adopted with a modified Potts spin model[2]. Since all linear constraints are embedded in the mean field equations, we do not need to add any penalty terms of the constraints to the error function. In other words, there is no ”soft constraint” in our mean field model and all constraints are automatically satisfied during the annealing process, thereby not only making the optimization more efficiently but also eliminating unnecessary parameters of penalty that usually require careful tuning dependent on the problems. 2 Pairwise Structure Alignment
منابع مشابه
Protein structure alignment by deterministic annealing
MOTIVATION Protein structure alignment is one of the most important computational problems in molecular biology and plays a key role in protein structure prediction, fold family classification, motif finding, phylogenetic tree reconstruction and so on. From the viewpoint of computational complexity, a pairwise structure alignment is also a NP-hard problem, in contrast to the polynomial time alg...
متن کاملMASTR: multiple alignment and structure prediction of non-coding RNAs using simulated annealing
MOTIVATION As more non-coding RNAs are discovered, the importance of methods for RNA analysis increases. Since the structure of ncRNA is intimately tied to the function of the molecule, programs for RNA structure prediction are necessary tools in this growing field of research. Furthermore, it is known that RNA structure is often evolutionarily more conserved than sequence. However, few existin...
متن کاملA Fuzzy Matching Approach to Multiple Structure Alignment of Proteins
Motivation: Comparing protein structures is important for understanding the relationships between sequence, structure and function. With the increasing number of experimentally determined protein structures, an automated algorithm that can perform structure alignment of many proteins is therefore highly advantageous. Results: We present a novel approach for multiple structure alignment of prote...
متن کاملMultiple alignment by sequence annealing
MOTIVATION We introduce a novel approach to multiple alignment that is based on an algorithm for rapidly checking whether single matches are consistent with a partial multiple alignment. This leads to a sequence annealing algorithm, which is an incremental method for building multiple sequence alignments one match at a time. Our approach improves significantly on the standard progressive alignm...
متن کاملMultiple structural alignment for distantly related all beta structures using TOPS pattern discovery and simulated annealing.
Topsalign is a method that will structurally align diverse protein structures, for example, structural alignment of protein superfolds. All proteins within a superfold share the same fold but often have very low sequence identity and different biological and biochemical functions. There is often significant structural diversity around the common scaffold of secondary structure elements of the f...
متن کامل